Operator Fusion, Memory Bandwidth, Graph Optimization, Intermediate Elimination

Duality-Based Fixed Point Iteration Algorithm for Beamforming Design in ISAC Systems
arxiv.org·1d
🎯Tensor Cores
Flag this post
Autonomous Anomaly Detection in LiDAR-Based Autonomous Navigation for Jetson AGX Orin
dev.to·17h·
Discuss: DEV
🏎️TensorRT
Flag this post
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
paperium.net·12h·
Discuss: DEV
Flash Attention
Flag this post
Hidden network preserved in Slide-tags data allows reference-free spatial reconstruction
nature.com·1d
🔀Operator Fusion
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·1d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·1d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Kalman Filter Algorithm: Core Principles, Advantages, Applications, and C Code Implementation
devresourcehub.com·11h·
Discuss: DEV
🔀Operator Fusion
Flag this post
Multi-Splitting Forking Based Modular Security of Signatures in Multivariate Quadratic Setting
eprint.iacr.org·3d
🎯Tensor Cores
Flag this post
Best Open Source Observability Solutions
clickhouse.com·1d·
Discuss: Hacker News
🔀Operator Fusion
Flag this post
My first fifteen compilers (2019)
blog.sigplan.org·22h·
Discuss: Hacker News
🚀Compiler Optimization
Flag this post
Contribution-Guided Asymmetric Learning for Robust Multimodal Fusion under Imbalance and Noise
arxiv.org·1d
📉Model Quantization
Flag this post
Reinforcement learning driven adaptive graph construction for fault diagnosis of chemical processes
sciencedirect.com·2d
🔄ONNX
Flag this post
[D] Best (free) courses on neural networks
reddit.com·3h·
👁️Attention Optimization
Flag this post
Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
arxiv.org·1d
📊Gradient Accumulation
Flag this post
Symbolic Alchemy: Transmuting Linear Solvers into Lightning Speed by Arvind Sundararajan
dev.to·2d·
Discuss: DEV
🎯Tensor Cores
Flag this post
Graph RAG vs SQL RAG
towardsdatascience.com·5h
ONNX Runtime
Flag this post
Text rendering and effects using GPU-computed distances
blog.pkh.me·2h·
✂️CUTLASS
Flag this post
Performance evaluation of image convolution with gradient filters in OpenCL
milania.de·3d·
Discuss: Hacker News
🔢cuBLAS
Flag this post
TinyML is the most impressive piece of software you can run on any ESP32
xda-developers.com·1d
ONNX Runtime
Flag this post